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(57) Abstract 



Web browser software (3!) is implemented 
in a processing system housed in a set-tt^ box 
(10) cmnected to a television and communi- 
cating over a wide-narea network with one or 
more serven (4). The tnowser allows a user 
to use a remote control device (1 1) to navigate 
between hypertcju anchors m World-Wide-Web 
pages (100) displayed on the television (12). Tbc 
browser is caf^ble of downloading audio files 
frnn Web sites and generating sound thrmi^ a 
speaker of die television set (12) based on dx 
audio files. Background music b generated dur- 
ing Web browsing based on audio files download 
from one of the servers (5) or audio files stored 
in memory (23) in the set-top box. 
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User Interface for Controlling Audio Functions in a Web Browser 

Hie present application is a continuatioii-in-part of U.S. patent application 
having application no. 08/660088, filed on June 3, 1996. 

FIELD OF THE INVENTION 

The present invention pertains to the field of browser software for 
browsing the World Wde Web. Moie particulariy, the present invention relates to 
audio fiinctions in a Web browser. 
BACKGROUND OF THE INVENTION 

The number of homes and businesses using personal computers has 
increased substantially in recent years, and along with this increase has come an 
explosion in the use of the Internet, and panicularly the Worid-Wde Web ("die 
Web"). The Web is a collecti(»i of formatted hypertext pages located on numerous 
computers around the world that are logically connected by the Internet Aldioug^ 
"the Web" has in the past been a source of primarily scientific and technical 
information, it is now a valuable resource for information relating to almost any 
subject, including business, entertainment, travel, and educaticm, to name just a 
few. Advances in network technology, and especially in software such as "^eb 
browsers" (software applications which provide a user interface to the Web), have 
made the Web accessible to a large segment of the population. However, despite 
the growth in die development and use of the Web, many people are still unable to 
take advantage of diis inqxjrtant resource. 

Access to the Web has been limited to people who have access to a 
personal conq)uter. Yet some people cannot afford the cost of even a relativdy 
inexpensive personal computer, while others are unable or unwilling to learn the 
basic conq)uter skills that are required to access the Web. Furthenmore, Web 
browsers in the prior art generally do not provide die degree of user-fiiendliness 
desired t^y some people who lade computer experience, and many computer 
novices do not have the patience to learn how to use die software. Some people, 
for exan^le, dislike an interface which requires diem to identify hypertext objects 
and use die point-and-dick technique to browse dnough Web i^ges. 

Most people, however, fed quite comfortable using a remote control to 
operate a television set Therefore, it would be desirable to allow a person to 
access die Woiid-Wide Web witfiout die use of a personal con^ter. In particular. 
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it would be desirable for a person to be able to access and navigate through Web 
pages using an ordinary television set and a remote control. It would further be 
desirable to have a user interface by which a person can use a remote control to 
navigate between hypertext objects on a Web page with minimal effort or thought, 
so that a person feels more as if he or she is simply changing television channels 
rather than utilizing a complex computer netwodc 

Another problem associated with Web browsers is that they tend to provide 
minimal audio functionality. Audio files linked to Web pages are becoming 
increasingly more common. Although some Web browsers may provide 
capability to download and play audio files, they generally do not provide 
c^ability to manipulate audio data to any significant extent. In addition, Web 
browser user interfaces generally are not set up to facilitate audio functionality. 

SUMMARY OF THE INVENTION 

A method is provided of processing audio information in a client 
processing system that is capable of being coupled to a server processing system 
over a netwoik and is capable of being coupled to an audio ou^ut device. In the 
method, a first audio sequence is output via the audio output device based on a 
first audio file stored in the client processing system. A seccmd audio file, which 
represents a second audio sequence that is to be output following output of the first 
audio sequence, is downloaded into the client processing system fi*om the network 
while the first audio sequence is being output The second audio file is transmitted 
fiom the server processing system to the client processing system over the 
network. 

Anotfier aspect of the present invwition is a method of providing a user 
interface for controlling audio functions of a client processing system. The client 
processing system is coupled to a server processing system over a network and to 
a display device. In the method, audio information is downloaded from the 
network, and an audio control panel is displayed on die display device. The audio 
control panel includes a number of images, each of which is for activating one of a 
number of audio functions for manipulating the downloaded audio information. 

Other features of die present invention will be apparent from the 
accompanying drawings and ftom the detailed description which follows. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Hie piesent inventioii is illustrated by way of example and not limitation in 
the figures of the accompanying drawings, in which like references indicate similar 
elements and in which: 

Hgurc 1 A illustrates several WebTV client systems connected to a WebTV 
server system. 

Figure IB illustrates a WebTV client sysieia 

Rgme IC is a block diagram of an electronics unit of a WebTV client 
system. 

Hgure 2A illustrates the functional relationship between hardware and 
software in the client processing system of Figure !. 

Ingure 2B is a block diagram of the Web browser application software of 
Figure 2B. 

Rgure 3 illustrates a remote control for controlling the Web browser. 

Hgures 4A and 43 illustrate a displayed Web page ccmtaining a nimiber of 
hypertext anchors. 

Figure 5 is a flow diagram illustrating a procedure for receiving and 
processing of an ITTML document according to the present invention. 

Rgure 6 illustrates several hypertext anchors as displayed on a display 

device. 

Figures 7 A and 7B are flow diagrams illustrating the procedure of 
movenoent of a selection between two hypertext anchors in the downwaid direction 
using the remote control of Figure 3. 

Figures 7C and 7D are flow diagrams illustrating the procedure of 
mavoBcat of a selection between two hypertext anchors in the upward direction 
using die remote control of Rgure 3. 

Rgures 8A illustrates an Opdons panel overlaid on a displayed Web page. 

Rgures SB illustrates an Options panel with an Audio pand overlaid cm a 
displayed Web page. 

Rgure 9 is a flow diagram illustrating a routine for displaying an Options 

panel 

Rgure 10 is a flow diagram illustrating a routine for enabling or disabling 
display of an Audio panel. 
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Figures 1 1 A and 1 IB are flow diagrams iUustrating routines for 
downloading and playing background music in a client system. 

DETAILED DESCRIPTrON 

A noethod and apparatus for providing a user interface to control audio 
functions in a Web browser are described. In the following description, for 
purposes of explanation, numerous specific details are set forth in order to provide 
a thorough understanding of the present invention. It will be evident, however, to 
one skilled in the art that the present invention may be practiced without these 
specific details. In other instances, well-known structures and devices are shown 
in block diagram form in order to facilitate description. 

In one embodiment, steps according to the present invention are embodied 
in machine-executable software instructions, and the present invention is carried 
out in a processing system by a processor executing the instructions, as will be 
described in greater detail below. In other embodiments, hardwired drcuitiy may 
be used in place of, or in combination with, software instmctions to implement the 
presoit invention. 

The present invention relates to a system in which a set-top box is 
connected to a television and to one or more servers over the Intemet The set-top 
box includes a processing system that executes browser software to enable a user 
to browse through World-Wide Web pages displayed on the television using a 
remote control device. As wiU be described below, the browser provides a user 
intof ace that includes an audio panel for controlling audio functions related to 
audio files downloaded from the Web. In addition, the Web browser provides a 
continuous stream of background music to the user during Web browsing by 
downloading audio files from a server while other audio is being played to the 
user. 

In one embodiment, the present invention is included in a system known as 
WebTV'"^' (WebTV), which uses a standard television set as a display device for 
browsing the Web and which connects to a conventional network, such as the 
Internet, using standard telephone, ISDN, or similar communication lines. In 
accordance with the present invention, a user of a WebTV cliait system can utilize 
WebTV network services provided by one or more renx>te WebTV servers. The 
WebTV network services are used in conjunction with software luiming in a 
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WebTV client system to browse the Web» send electronic mail, and to make use of 
the Internet in various other ways. The WebTV servers fiincdon as proxies by 
retrieving, firom a remote server, Web pages or other data requested by a WebTV 
client system and then transmitting the requested information to the WebTV client 
system. 

1. System Overview 
Figure 1 A illustrates a configuration of a WebTV network according to one 
embodiment A number of Wd>TV chents 1 are coupled to a modem pool 2 via 
direct-dial, bi-directional data connections 29, wtdch may be telephone (POTS, 

1. e., "plain old telephone service"), ISDN (bitegrated Services Digital Network), 
or any other similar type of connection. The modem pool 2 is coupled typically 
through a router, such as that conventionally known in the art, to a niunber of 
remote servers 4 via a conventional network infirastructure 3, such as the Internet 
The WebTV system also includes a WebTV server 5, wtach specifically supports 
die WebTV clients 1. That is, the server 5 provides the WebTV services to die 
WebTV clients 1. Each of the clioits 1 can connect to the server 5 either through 
direa telephone or ISDN OHinection or through the Internet 3 via the modem pool 

2. Note that the nKxiem pool 2 is a conventional modem pool, such as those 
found today throughout the world providing access to the Internet and private 
networks. Modem pool 2 may be provided by a local Internet Service Provider 

asp). 

The WebTV server 5 generally includes one or more conventional 
coiiq)uter systans. The server 5 may actually conqnise multiple physical and 
logical devices connected in a distributed arcMtecttire. 

Hgure IB illustrates a WebTV client system 1 according to one 
embodiment The c^ent system 1 includes an electronics unit 10 (herdnafter 
refored to as "the WebTV box 10" or "die box 10"), an ordinary television set 12, 
arni a hand-held renK)te control 1 1. In an alternative embodiment of the present 
invention, the WebTV box 10 is built into the television set 12 as an integral unit 
TTie box 10 includes hardware and software far providing the user witii a graphical 
user interface, by which the user can access the WebTV network services, browse 
the Web, send e-mail, and otha:wise access the Internet 



WO-98/22889 



PCT/US97/2117i 



-6- 

The client system 1 uses the television set 12 as a display device and an 
audio output device. The box 10 is coupled to the television set 12 by a link 6. 
The link 6 includes an audio channel for generating sound from the television's 
speakCT and an RF (radio frequency), S-video, composite video, or other 
equivalent form of video channel. The communication link 29 between the box 10 
and the server 5 is either a telephone (POTS) connection 29a or an ISDN 
connection 29b. The box 10 receives AC (alternating current) power through an 
AC power line 7. 

Remote control 1 1 is operated by the user in order to control the client 
system 1 to browse the Web, send e-mail, and perform other Internet-related 
fiincticHis. The box 10 receives commands from remote feontrol 1 1 via an infrared 
(IR) communication Unk. In alternative embodiments, the link between the remote 
control 1 1 and the box 10 may be RF or any equivalent naode of transmission. 

The box 10 includes application software which, when executed by a 
processor in the box 10, provides the user with a graphical user interface by which 
the user can access the WebTV network services and browse the Web. The 
application software is automatically executed upon application of power to the 
box 10. 

Figure IC shows die internal cotiqx)nents of the box 10. Operation of the 
client system 1 is controlled by a CPU 21, which is coupled to an Application- 
Specific Integrated Circuit (ASIQ 20. The CPU 21 executes software designed to 
implement features of the present invention. ASIC 20 contains circuitry which is 
used to inq)lement certain funcdons of the WebTV system. ASIC 20 is coupled to 
an audio digital-to-analog converter 25 which provides audio ou^ut to television 
IZ In addition, ASIC 20 is coupled to a video encoder 26 which provides video 
output to television set 12. An IR interface 24 detects IR signals transmitted by 
remote control 1 1 and, in response, provides coiresponding electrical signals to 
ASIC 20. A standard telephone modem 27 and an ISDN modem 30 are coupled to 
ASIC 20 to provide connections 29a and 29b, respectively, to the nKxiem pool 2 
and, via die Internet 3, to die remote servers 4. Note that, while die illustrated 
embodiment includes both a telephone modem 27 and an ISDN modem 30. eidier 
one of these devices will suffice to practice the present invention. Further, in 
various otha- embodiments, the telephone modem 27 and the ISDN modem 30 
each may be rqilaced by or supplemented witii odier communications devices. 
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such as a cable television modem. In addition, in other embodiments, 
communication widi the server 5 might be made via a token ring or.Ethemrt 
connection. Note that the box 10 also may include a cable televisi(Hi modem (not 
shown). 

Also coupled to ASIC 20 is mask Read-Only Memory (ROM) 22a, a flash 
memory 22b, and a Random Access Memory (RAM) 23. Mask ROM 22a (which 
is so named because it is non-programmable) provides storage of certain program 
instructions and data, as will be described below. Flash memory 22b is a 
conventional flash memory device that can be written to (programmed) and erased 
electronically. Flash memory 22b provides storage of the browser software as 
well as data. In one embodiment, a mass storage device 28 is included in the box 
10 and coupled to ASIC 20. The n&ss storage device 28 itiay be used to input 
software or data to the client or to download software of data received over 
network connection 29. The mass storage device 28 includes any suitable medium 
for storing machine-executable instructions, such as magnetic disks, optical disks, 
and the like. 

As mentioned above, the WebTV box 10 includes application software 
including a Web browser.. Referring now to figure 2A, the above-mentioned 
application software 31 operates in conjunction with operating system (OS) 
software 32, The OS software 32 includes various device drivers and otherwise 
provides an interface between the a^jplication software 31 and the system hardware 
co^^x)nents 40 (i.c., the elements illustrated in Figure 1C5. 

In one embodiment^ the application software 31 and the OS software 32 
are stored in flash memory 22b. It will be recognized, however, that some or all 
of dtiier the application software 3 1 or the OS software 32 or both can be stored in 
any other suitable storage medium, such as mask ROM 22a or mass storage device 
28, in various embodiments. 

As mentioned above, steps according to the present inventicm are embodied 
in niachine^ecutable instructions according to one oiibodiirient Forexanq)le,in 
one embodiment the preset invention is carried out by the CPU 21 executing 
sequences of instructions contained in mask ROM 22a, flash memory 22b, or 
RAM 23, or a combination of these devices. More specifically, execution of the 
sequences of instructions causes the CPU 21 to perform the steps of the present 
invention. These steps will be described below. Instructions for carrying out the 
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prescnt invention may be loaded into memory from a persistent store, such as 
mass storage device 28. and/or from (Mie or more other computer systems over a 
network, such as the WebTV server 5 or a remote server 4. For example, such a 
server system may oransmit a sequence of instructions to the client system 1 in 
response to a message transmitted to the server system over the Internet 3 by the 
client system L As the client system 1 receives the instructions via a network 
connection, such as modem 27a, the client system 1 stores the instructions in a 
memory. The client system 1 may store the instructions for later execution or 
execute the instructions as they arrive over the network connection. 

In some embodiments, the downloaded instructions may be directiy 
supported by the CPU 21. Consequentiy, execution of the instructions may be 
performed directiy by the CPU 21. In other embodiments, the instructions may 
not be directiy executable by the CPU 21. Under tiiese circumstances, the 
instructions may be executed by causing tiie CPU 21 to eixecute an interpreter that 
interprets the instructions, or by causing the CPU 21 to execute instructions which 
convert the received insuuctions into instructions that can be directiy executed by 
die CPU 21. 

Certain embodinients and aspects of the present invention may be also 
carried out in tiie WebTV server 5, instead of (or in addition to) being carried out 
in the WebTV client system 1. For example, a CPU of die server 5 may execute 
instmctions stored in memory to poform steps in accordance with the present 
invention. 

In various embodimwits, hardwired cncuitry may be used in place of, or in 
combination with, software instructions to implement die present invention. Thus, 
the present invention is not limited to any specific combination of hardware 
circuitry and software, nor to any particular source for the instructions executed by 
a computer system. 

The Web browser fiinctions of the present invention are implemented by 
application software 31. Figure 2B illustrates the functional relationships between 
the various components of application software 31. Application software 31 
includes a commimications manager 50, an HTML (Hypertext Mark-up Language) 
parser 51 , a graphics engine 52, a layout engine 53, an input system 54, a user 
interface 56, a selection handler 55, and a streams manager 57. The client system 
receives Web pages over network connection 29 in the form of HTML documents. 
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Streams manager 57 controls and coordinates movement of data throughout the 
system. 

Each HTML document is initially input to communications manager 50. 
Communications manager 50 perfonns functions necessary to receive an HTML 
document, aver the netwoik connection 29 including handling high-level 
communications protocols, such as HTTP (Hypertext Transport Protocol), 
TCP/IP (Transmission Control ProtocoVIntemet Protocol), etc. A received HTML 
document is provided by communications manager 50 to HTML parser 51 . 
HTML parser 51 converts HTML page descriptions to a displayable format for the 
client system. 

As is well known, an HTML document may contain text and/or refer to 
images and other media types. Further, an HTML document may include one or 
more "hypertext anchors," which are displayable objects that provide a link to 
another Web page. Hypertext anchors are also sometimes referred to as "hot 
links" or "hot spots." Each hypertext anchor is associated with a particular URL 
(Uniforai Resource Locator) or other logical address representing the location of 
the Web page to whidi the anchor provides a link. The URL may or may not be 
stored within the client system, as will be discussed bdow. A hypertext anchor 
may be a word or a phrase of text, an images or a part of -an image. The hypertext 
anchors and other objects wMdi are to be displayed on a screen are defined in the 
HTML document by a number of "tags" which specify information relating to the 
displayable object and its attributes (e.g., whether the object is a hypertext anchcR-, 
an image map, an image map containing hypotext anchors, etc.):^ Acccndingly, 
HTML parser 5 1 separates tags fiom text wi^ each HTML docimient under the 
control of the streams manager 57. 

HTML parser 51 also generates a list of "displayables" (displayable 
objects) ham the received HTML document In addition, the HTML parser 51 
generates a sorted list of "selectables" (selectable objects, ie.. displayable objects 
which are hypertext anchors). The method of generating this sorted list is 
discussed below in greater detail The data structure containing the list of 
displayables is provided by HTML parser 5 1 to layout engine 53, which places the 
displayable objects on the screen of the television set 12. Image information, such 
as JI^G or GIF image infortnation, are provided by HTML parser 51 to layout 
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engine 53 via a graphics engine 52, which is responsible for measuring and 
drawing images. 

In the course of browsing through a displayed Web page or between 
different Web pages, a user activates buttons on remote control 1 1, which, in 
response, transmits IR signals that are received by the box 10. Input system 54 
records these inputs and provides them to selection handler 55 and user interface 
56. In addition, input system 54 can handle input queuing and processing of 
inputs from various other input devices, such as a standard or IR keyboard 
Selection handler 55 receives commands provided by the user for moving a 
selection (of a hypertext anchor) around on the screen (i.e., moving a selection 
from one hypertext anchor to another). The selection handler 55 determines, in 
response to the user inputs, which of the hypertext anchors in a Web page should 
be a current selection and provides a selection indication to the user of the current 
selection through the display of television set 12. Otiier aspects of die Web 
browser which are visible to the user are provided by user interface 56 based on 
signals received from input system 54. 

Figure 3 illustrates remote control 11 in greater detail. In the preferred 
embodiment, remote control 11 is similar to, and has the general appearance of, a 
common, hand-held remote control that is used to control a television set In fact, 
remote control 1 1 may include controls for performing ahy or all of the user- 
controlled functions of television set 12. In an alternative embodiment, remote 
control 1 1 may be a wired hand-held controller or a wired or wireless (e.g., IR) 
keyboard or any other similar device. Remote control 1 1 includes Options button 
7, Scroll Up button 8, Scroll Down button 9, direction buttons 15-18, Enter 
button 19, Home button 35, Back button 36, and Recent button 37. Ejection 
buttons 15-18 include left button 15, right button 16, up button 17, and down 
button 18. 

The functions of the controls on remote control 1 1 will be described below 
in detail. However, a summary of certain ones of these functions is helpful at this 
point Home button 35 causes the client system 1 to directly access and display 
the Home page of the WebTV network services. Back button 3 allows the user to 
retrace his steps by revisiting previously-visited Web pages in reverse 
chronological order. Recent button 37 is used to access a display indicating the 
most recent Web sites (pages) the user has visited. From this display, the user 




wo 98/22889 



PCTAJS97/21171 



-11- 

will be allowed to directly revisit any of those sites. Scroll Up button 8 and Scroll 
Down button 9 are used.to scroll the screen up or down, respectively, to display a 
previously undisplayed portion of a Web page. Direction buttons. 15-18 allow the 
user to specify a direction of movement in order to select a new hypertext anchor. 
When Bnta button 19 is pressed, remote control 1 1 sends a command to the box 
10 to activate the hyp^text link conesponding to the cunendy selected hypertext 
anchor. Hiat is, in response to Enter button 19 being pressed, die uniform - 
resource locator (URL) or other logical address of the currendy selected hypertext 
object is transmitted by the box 10 over die network connection 29Xexcq)t in the 
case of certain image maps, as will be described below) in order to access the 
corresponding Web page. 

n. Hotlink Navigation 

Kgure 4A illustrates an cxan^le of a Web page 100 which may be 
displayed by television 12 using die client system. Web page 100. is a sanq)le 
home page for a fictitious company. Company XYZ. Web page 100 includes a 
number of hypertext anchors 101-1 15, each of which provides a link to another 
Web page that is not currendy displayed. In Web page 100, hypertext anchors 
101. 102, 109, 1 10, 1 1 1, and 115 are image maps. Hypertext anchors 101 and 
102 are mcluded within image map 116, which is not a hypertext anchor. 
Hypertext anchor 101 is the logo for Company XY21 Logo anchor 101 may 
provide a link, for exaicqple, to another Web page v^ch provides a history of 
coo^any XYZ. Hypertext anchor 115 may provide a link, for example, to an 
audio stream which generates the jingle (theme song) of Company XYZ, and 
which can be ou^ut to the user through the speaker of television IZ The 
remainderof the Web page 100 ooritaiiis iiiforinational text about Con^>any XYZ 
grouped under the headings: "Product Information," "Cjompany News," 
"Personnel Profiles," "Sales Offices," and "Job Opportunities." Hypertext 
anchors 103-108 and 1 12-1 14 are key words or phrases within the text 

Hie ^plication software 3 1 provides means fcr the user to navigate 
between the hypertext anchors displayed on a Web page, such as Web page 1(X), 
using the remote control 1 1. In particular, application software 31, in response to 
user inputs from remote control 11, moves a selection from one hypertext object to 
another in a discrete movem^ At a given point in time, <nily one displayed 
hypertext c^ject is selected and is displayed as a current selection. In Ingme 4A, 
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for example, hypertext anchcw 101 is selected and is indicated as the current 
selection by highlighting 118. Once a hypertext anchor is selected, the user can 
activate its link to another Web page by pressing "Enter" button 19. To change the 
current selection, the user specifies a direction in which he wishes to "move" the 
selection by pressing any of direction buttons 15-18 on remote control 1 1. In 
response to one of these inputs, the ^plication software 31 determines which of 
the other hypertext objects in the HTML document, if any, to select and display as 
the new cxmient selection. In prior systems, the user is required to move a pixel- 
accurate cursor over the screen in continuous movements and to determine when 
the curisor coincides with a hypertext anchor. In contrast, the present invention 
determines the location of all hypertext objects and selects a new hypertext anchor 
based on the inputs received from remote control 1 1. The user is not required to 
locate hypertext objects. 

Assume now that the user wishes to change the current selection, hypertext 
anchor 101, to a new hypertext anchor - anchor 104, for example. The present 
invention allows the user to do this by pressing one of direction buttons 15-19 on 
remote control 1 1. Hgurc 4B illustrates tiie Web page 100 as displayed with the 
anchor 104 as the current selection, as indicated by highlighting 1 18. Using 
remote control 1 1, the user may move the current selection successively through 
all of the displayed hypertext anchors and may also access and select hypertext 
anchors in the cmrcnx HTML document which arc not cumendy displayed by 
scrolling the documwit 

Figure 5 illustrates a routine for setting up a data structure performed by 
the ^plication software 3 1 in order to permit die functions just described. 
Initially, an HTML document is received via die network connection 29 (step 
501). Next, the HTML document is parsed into text and tags (step 502). From 
the parsed HTML document, a list is generated of the displayables in the current 
HTML document (step 503). Displayables may include either text or images. 
Methods of identifying displayables in an HTML document are well known and 
need not be discussed herein. From the list of displayables. a sorted list of 
selectables (e.g., hypertext anchors) is generated (step 504). 

The list of selectables is sorted as it is generated according to an insertion 
sort routine. The selectables are sorted according to their positioning on the Web 
page, as it is to be displayed. More specifically, die list of selectables is formed 
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according to the direction of reading or writing in the English language (Le., left to 
right, top to bottom). Sorting is performed first according to vertical position, and 
then according to horizcHital position. Acccidingiy, a selectable would be inserted 
into the selectable list at a position before any selectables which it appears higher 
than on the Web page; a selectable would also be inserted into the selectable list at 
a position before any selectables that it appears to the left of on the Web page but 
have the same vertical position. 

In performing the insertion sort to generate the list of selectables, the 
determination of vertical positioning requires a determination of whether a given 
selectable falls "above" or rbelow" an existing enory in a selectabk list This 
determination can be con:q)licated by the fact that there ispften overly in the 
vertical direction between two selectables. In Figures 4A and 4B, for example, 
there is vertical overlap between anchors 109 and 1 10. The present invention 
resolves this difficulty in the following way: If there is overlap between a given 
selectable and another selectable, then the given selectable is considered to be 
"below** the other selectable if the overlap between the two selectables is less than 
one-half of the height of die smaller of the two selectables and the top of the given 
selectable is below the top of the other selectable. If the overlap is not less than 
one-half of the height of the smaller of die two selectables or if the top the given 
selectable is not below the top of the other selectable, then the given selectable is 
not considered to be ''below" the other selectable. This same criterion is sailed to 
determine when a given selectable can be ccmsidered "above" another selectable. 
Further, this noethod of defining the conditions "above" and /'below" is also used 
in moving the current selection in response to a user input, as will be discussed 
below. 

Referring now to Hgures 4A and 4B for exan^le, the sorted list of 
selectables generated from Web page 100 would have the following order (listed 
by reference numeral): 101, 102, 104, 103, 105, 106, 107, 108, 109, 110, 112. 
1 1 1 , 113, 1 14, 1 15. It should be appreciated that in the actual inq)lemratadon,.the. 
sorted list of selectables may actually constitute a list of pointers to entries in the 
list of displayables. Note that in reading Web page 100 from top to bottom (in 
English), a person would encounter anchor 104 before encountering anchor 103, 
since andiar 104 is in a line of text that is above die line of text including andKr 
103. Consequently, anchor 104 q)pear5 before anchor 103 in the sorted list of 
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selectables. Note further that anchor 1 12 appears before anchor 1 1 1 in the sorted 
list of selectables; this is so because there is overlap in the vertical direction 
between these two anchors, and the anK>unt of overlap includes the entire height of 
anchor 112. Consequendy, anchors 111 and 1 12 are considered to have the same 
vertical rank on Web page 100. However, since anchor 1 12 appears to the left of 
anchor 111, anchor 1 12 is inserted into the sorted list before anchor 1 1 L 

TTie method of moving a selection between hypertext anchors using remote 
control 1 1 will now be described with reference to Figures 6, 7A, and 7B. As 
mentioned above, a user may move the current selection left or right by one 
hypertext anchor by pressing left button 15 or right button 16, respectively. In 
response to receiving an IR signal corresponding to either button 15 or button 16, 
the software 31 of the preseint invention (specifically, selection handler 55) simply 
moves one position through the sorted list to detemune the new current selection. 
Hiat is, if left button 15 was pressed, the new current selection will be one 
position higher in the sorted list from the current selection, whereas, if right button 
16 was pressed, the new current selection wiD be one positioii lower in the sorted 
list from the current selection. In response to any of direction buttons 15-18 being 
pressed, a new current selection will be determined and the highlighting 118 will 
be moved to indicate the new current selection. The screen may be scrolled to 
display the current selection, as discussed below. As mentioned above, the user 
may activate the hypertext link of the current selection by pressing "Enter" button 
19. 

Movement of the current selection in response to down button 18 being 
pressed will now be described with reference to Rgure 7A. When down button 
18 is pressed, each entry (selectable) that appears in die sorted list after the current 
selection is examined according to the following routine to determine whether it is 
a Best Guess. Hie Best Guess is a selectable which is, at least tcmporaiily, 
considCTcd to be the most likely selectable to be the new current selection based on 
die last user input As each entry in the list is tested, whichever entry is currendy 
considered to be the Best Guess may then be replaced by the selectable being 
tested if the selectable being tested is determined to be a better choice. Once the 
end of the list is reached, die selectable diat is designated as the Best Guess will be 
taken to be the new current selection. Accordingly, highlighting 118 will be 
moved to indicate the new current selection to the user. 
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In response to the down button 18 being pressed, the ^plication software 
31 initially d^emunes which selectable in the sorted list of selectables is the next 
selectable after the cuirent selectable (ie., after the current selection) (step 701); 
the result of this determination will be referred to herein as "selectablej". Next, 
the detennination is made of whether selectable J is cunendy displayed on the 
screen (step 702). If not, then the screen is scrolled down by a predetermined 
number of pixels, but less than an entire screen (step 714). An exemplary number 
of pixels is 100, although that number is essentially arbitrary. After the screen is 
scrolled down by the predetermined number of pixels, another determination is 
made of whether selectablej is displayed on the screen (step 715) If the 
selectable_l is still not displayed on the screen, then the current selection is not 
changed, and the highlighting 1 18 is not moved (step 713). Note that if the 
current selection is no longer visible after the scroll, no selecticm is displayed. 

If, however, after scrolling down by the predetermined number of pixels 
(step 714), selectable_I is displayed on screen, or if selectable.I was displayed 
initially (step 702), then a determination is next made of whether the vertical 
overlap between the current selection and selectable J is less than one-half of the 
height of die smaller of the current selection and selectable _J (step 703). If so, a 
determination is made of whether the top of selectable.1 is below die top of the 
current selection (step 704): Note that die order of diese two determinations (steps 
703 and 704) can be reversed, if desired. Note also that tiiese two determinations 
wmstitute die same test used to detemune die "below" condition when generating 
die sorted list of selectables. 

If die answer to ddier of these two detenounations is "no," then the process 
proceeds to a determination of whedier selectable_I is die last selectable in the 
sorted list (step 711). If selectablej is not die last selectable in die list, dien die 
procedure of Hgures 7 A and 7B is rq>eated, taking die next selectable in die list as 
selectablej (steps 718, 701). Alternatively, if selectablej is die last selectable in 
the Hst, dioi die new current selection is taken to be whichever selectable is 
currnidy designated as the Best Guess, as explained below (stq> 712). 

If (in step 703) the vertical overlap between die current selection and 
selectablej is less dian one-half of die height of die smaller of die current selection 
and selectable.I, and Qn step 704) die top of selectablej is below the top of die 
cunem selection (st^ 704), dien it is next determined whedier there is currendy a 
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Best Guess (step 705). If there is not yet a Best Guess, then the Best Guess is 
taken to be selectablej (step 710). Again, if selectable.! is not the last selectable 
in the list, then the process repeats ton the beginning with the next entiy in the 
list If ( in step 705) there was aheady a Best Guess, then a determination is made 
of whether the top edge of selectable_I has a higher vertical position (smaller "Y" 
coordinate) than die bottom edge of die Best Guess selectable (step 706). If not, 
dien selectablej is disregarded as a potential new Best Guess, since it is lower on 
the scre^ than the current Best Guess. The reason for disregarding selectable.! in 
this case is an assumption that the user would not wish to jump from the current 
selection to selectablej if tiiere is another selectable that is below the current 
selection but above selectablej. Accordingly, in that situation a determination is 
again made as to whether selectablej is the last selectable in die list (step 711). If 
not, die process repeats from die beginning (steps 7 18, 701) using die next entry 
in the list 

If (in step 706) die top of selectablej is higher dian die bottom of die Best 
Guess, then it is determined how much overlap. Guess Overlap, exists (if any) 
between die Best Guess and die current selection (step 707) in die horizontal 
direction. Similarly, a determination is made of how much horizontal overlap, 
Selectable Overlap, exists (if any) between selectablej and die current selecticm 
(step 708). If Guess Overiap is less ±an Selectable Overlap (step 709). dien die 
Best Guess is replaced by selectablej (step 710); diat is. selectablej is then 
considered to be die new Best Guess. Accordingly, die process repeats using die 
next entry in die sorted list of selectables, assuming selectablej was not die last 
selectable on the list 

If (in step 709) Guess Overlap is not less dian Selectable Overlap, dien two 
situations might exist The first situation is diat die overiap anrounts arc equal 
(i.e.. Guess Overly = Selectable Overly). The second is diat Selectable Overlap 
is greater dian Guess Overlap. Generally, the amount of horizontal overiap will be 
equal when Guess Overlap and Selectable Overlap arc both zero Oc when 
neidier selectablej nor die Best Guess overl^s die current selection horizontally). 
If Guess Overlap and Selectable Overlap are equal, then selectablej is taken to be 
die new Best Guess (in step 710) only if the left edge of die Best Guess is fardier 
ftom die left edge of die current selection than die left edge of selectablej is ftom 
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the left edge of the current selection (step 717). Otherwise, the process repeats 
using the next selectable in the list 

If Guess Overly is not less than Selectable Overlap and either (1) the 
Guess Overlap does not equal Selectable Overlap, or (2) the left edge of Best 
Guess is closer to the left edge of the cmrcnt selection than the left edge <tf 
selectablej is to the left edge of the current selection (steps 716 and 717). thai the 
Best Guess is retained, and the process repeats (unless selectable.! was the last 
selectable in the list). 

Once the last selectable in the list has been tested using the process just 
described, then the new current selection is taken to be whichever selectable is 
cumendy the Best Guess (step 712). The new current selection is then highlighted 
(step 713). 

The procedure of Hgure 7A will now be explained with lefCTence to Rgure 
6. Bgure 6 shows a number of hypertext anchors 131-136 and defines certain 
spatial relationships between those anchors. A scnrted selectable list of the anchors 
in Hgure 6 would have the following order 132, 131, 133, 134, 135. 136. 
Assume now that the current selection is anchor 131 as indicated by highlighting 
118. Assurne further tfiat the user has just entered an iiq)ut by pressing the down 
button 18 on renoote control 1 1. In response, the s^lication software 31 initiates 
the process of examining entries in the sorted selectable list which follow 
selectable 131 (Lc.. beginning with selectable 133). Note that selectable 132 
appears before 131 in the sorted list because it is "above" selectable 131 (according 
to the criteria described above) and is therefore not considered in^response to the 
down button 18 being pressed Therefore, according to the routine of figure 7 A, 
the anoount of vertical overly between selectable 133 and selectable 131, if any, is 
determined. In this example, the amount of overlap is four units. The dimensions 
illustrated in Hgure 6 are provided only for purposes of illustration ainl can have 
essentially any units, such as pixels, for exanqsle. Since the amount of vertical 
oveciap (four units) between selectable 133 and selectable 131 is greater than one- 
half of the height of the smaller of these two selectables (one-half of the height of 
selectable 133, v^^ch is 6/2, or 3), selectable 133 is not considered to be ^'below** 
selectable 131. Consequently, since tfaoe is cuirendy no Best Guess, selectable 
133 is ignored and the next selectable in the Hst, selectable 134, is examined. (If 
there were cunendy a Best Guess, thai that Best Guess would be retained.) 
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Ck)ntimiing now through the sorted list of selectables, selectable 134 
overlaps selectable 131 in the vertical direction by two units. Two units is less 
than one-half of the height of the smaller of these two selectables (which are equal 
in height). Further, the top edge of selectable 133 is below the top edge of 
selectable 131. Therefore, selectable 134 is considered to be "below" selectable 
13 1, and selectable 134 is taken to be the new Best Guess (step 710). The next 
selectable to be examined is selectable 135. Selectable 135 does not overlap 
selectable 131 and is, theiieforc, considered to be below the current selection. 
However, the top edge of selectable 135 is below the bottom edge of the Best 
Guess, selectable 134. Therefore, selectable 135 is not a "better" Best Guess than 
the current Best Guess, selectable 134. Consequendy, selectable 134 is retained 
as the Best Guess and the next selectable in the list, selectable 136 is examined 
Essentially the same analysis applies to selectable 136 as to selectable 135. 
TTierefore, when the end of the sorted list is reached, die Best Guess remains 
selectable 134. Consequently, it is determined that the new currait selection is 
selectable 134. Accordingly, selectable 134 is highlighted to so indicate. 

In a different scenario, assume now that the current selection is selectable 
132 and that down button 18 is pressed. The next selectable in die sorted list of 
selectables after selectable 132 is selectable 131. Therefore, selectable 131 is the 
first selectable to be examined. Selectable 131 does not overlap selectable 132 by 
more than four units {1/2 die height of selectable 131). Further, die top edge of 
selectable 131 is below the top edge of selectable 132. Therefore, selectable 131 is 
considered to be "below" selectable 132 and is taken to be die Best Guess. 
Moving dirough the sorted list, the next selectable to be examined is selectable 
133. Selectable 133 meets die conditions for being "below" die current selection, 
selectable 132. Moreover, die top edge of selectable of 133 is not below die 
bottom edge of die current Best Guess, selectable 131. Therefore, selectable 133 
is. so far, not eliminated from consideration. Instead, the amount of hoTi2X)ntal 
overly is deteraMned between selectable 133 and die current selection selectable 
132 and between selectable 131 and die current selection. In diis case, both 
selectable 131 and selectable 133 have no horizontal overlap widi die current 
selection, selectable 132. Therefore, die next step is to determine (step 717) 
whetfier die left edge of die current selection is closer to die left edge of selectable 
131 or to die left edge of selectable 133. Here, die left edge of 131 isclosCT. 
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Therefore, selectable 131 is retained as the Best Guess (step 710). The above 
process then continues by examining the next selectable in the sortBd list, 
selectable 1 34, and so forth, undl the end of the sorted list is reached. At that 
point, the Best Guess is taken as the new current selection. 

In yet another scenario, assume that the cuirent selection is selectable 133 
and that down button 18 was just pressed. Selectable 134 is the first selectable in 
the list to be examined. The overlq) between selectable 134 and 133 is greater 
than one-half the height of the smalls of these two selectables (i.e., greater than 
6/2, or 3). Therefore, selectable 134 is eliminated from consideration. At this 
point there is no Best Guess. Selectable 135 is considered next. Selectable 135 
does not veitically overiap the cunent selection, selectable 133, and the top of 
selectable 135 is below the top of the current selection. Therefore, selectable 135 
is taken to be the Best Guess. Next, selectable 136 is considered. Selectable 136 
does not overi^ the curroit selection, nor is the top edge of the selectable 136 
below the bottom edge of selectable 135. Therefore, the horizontal overiq). 
between the Best Guess and the current selection (Gue^sOverlap), and between 
selectable 136 and the current selection (Selectable Ovedap), are determined. The 
Guess Overiap is zero. However, Selectable Overlsq) is four units. Therefore, 
because Selectable Overlap is greater than Guess Overiap, selectable 136 is taken 
to be the new Best Guess. Further, since selectable 136 is the last selectable in the 
sorted list, the new current selection is taken to be the current Best Guess, 
selectable 136. 

The procedure of changing the selection in response to up button 17 being 
pressed is illustrated in Figures 7C and 7D. The procedure for up button 17 is 
sutetantially identical to the procedure for down button 1 8 (Rgures 7A and 7B) 
and therefore need be not be described on a step-by-step basis. Nonetheless, 
certain diffmnces should be noted. In response to the up button 17, in contrast 
with die procedure for down button 18: (1) entries in the list of selectables are 
examined nooving backward through the list (not forward) starting with the 
selectable immediately preceding tiie cunent selection (steps 751, 768), until all 
selectables in the list preceding the current selection have been examined; (2) the 
amcxmt of vertical overlap is calculated in order to determine whether a selectable, 
selectable J, is "above" the cunent selection, not "below" (step 754); (3) 
selectable_I is ignored as a possible new current selection if the bottom of 
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selectablcj is above the top of the current Best Guess (step 756); and, (4) if 
selectablej is not currently displayed, the screen is scrolled up (not down) by a 
predetermined nunaber (e.g.. 100) of pixels. 

lU. Audio Functions 

The application software 31 includes the capability to cause music and 
other sounds to be played through a speaker of the television set 12 based on audio 
data downloaded via the network connection 29. The downloaded audio data may 
be in any of a number of different formats, such as MIDI (Musical Instrument 
Digital Interface), MPEG (Moving Picture Experts Group), Real Audio, .AU, 
.AIFF, .WAV or others. Once decoded by the box 10, the received audio 
information is provided to the television set 12 via an audio channel of link 6. 
A. Background Music 

The WebTV service provides audio data to the box 10 for the purpose of 
providing background music or other sound effects (hereinafter simply 
"background music") to the user while the user is browsing the Web. The 
background music fills the audio channel of link 6, which would otherwise be 
unused, unless the user has activated a link to an audio file or a Web page provides 
its own background music. Hence, the background music provided by the 
WebTV service makes the Web browsing experience seem more like watching 
television to the user, since people are accustomed to hearing sound while viewing 
a television. In addition, background music is loaded into the cliait system 
asynchronously witii, and independent of. the user's actions, so that die user does 
not have to wait to hear audio. 

Audio data for generating background music may be in one of the above- 
mentioned audio formats. The background music may be downloaded into RAM 
23 and played inmiediately or simply retained for later use. Alternatively, 
downloaded audio files can be written into flash meniory 22b. Background music 
played to die user may be generated from audio files that have been previously 
stored in memory, or from audio files that are being downloaded from the servo: 5 
"on the fly", or a combination of tiiese techniques. 

The background music is generally played to the user while the user is 
viewing Web pages and while Web pages are being downloaded. In one 
embodiment, which is illustrated by Figure 1 1 A, die file for generating one 
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sequcncc of background music is downloaded from the server 5 (step 1 102) only 
after the previous sequence of background music has finished playing (step 1 101). 
This embodiment reduces memory consunq>tion in the client system 1 in 
con^arison to a second onbodiment, which is illustrated in Hgure 1 IB. In the 
embodiment of Ingure 1 IB, a ctment sequence of background music is played to 
the user (step 1 101) while the file for generating the next sequence is being 
downloaded from the server 5 (step 1 102). Thus, the embodiment of Figure 1 IB 
has the advantage of providing the user with a continuous, unintenrupted stream of 
background music. 

The WebTV background music is suspended, however; whenever a 
displayed Web page provides its own background music, such that the 
background music associated with the Web page is played instead. Once the user 
leaves that Web page, the WebTV background music is resumed In addition, if 
the user has accessed a particular hypertext link to an audio file, the WebTV 
background music is suspended while the accessed audio file is played. 

In addition, the ^plication software 31 also pnovides certain special sound 
effects. Tlie special sound effects are superimposed upon.(i.e., played 
concunendy with) the background music. Some of the sound effects are used to 
provide the user with audio feedback when operating the remote control 11. For 
example, a specific sound effea is associated with each function of die remote 
control 1 1 . Audio feedback such as this is advantageous, since the user of the 
present invention will likely be positioned farther from the display than he would 
be if he were using an ordinary personal con^uter.to browse the Web. 

B. Audio Panel 

The present invention includes a user interface for controlling audio 
functions related to audio files downbaded from the Web. This user intQ:£ace is 
presented to the user in the form of an "Audio panel", which is illustrated in Hgure 
8B. The audio panel is accessed by the user's pressing the OpUfms button 7 on 
the remote control 1 1. when audio is available. In general, pressing the Options 
button 7 causes an Options panel to be displayed, as illustrated in Figure 8A. The 
Options panel 140 is displayed as an overlay on a currendy displayed Web page 
100. The Options panel 140 provides the user with access to a number of 
managerial functicms of the WebTV browser which are accessed by soft buttcms 
148-152. Buttons 148-152 are selected and activated by the use of the remote 
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control 1 1 (more specifically, buttons 15-19 of remote control 11) in the manner 
described above. 

If the user activates the Options panel 140 while downloaded audio data is 
being played (in response to activating a hypertext link to an audio ftle)» the 
Options panel will be displayed with an Audio panel 154, as illustrated in Figure 
8B. If the last hypertext link to be accessed did not link to audio data, then the 
Options panel 140 is displayed without the Audio panel 154, as illustrated in 
Figure 8 A. That is, the Audio panel 154 is enabled to be displayed only when 
there is audio data to be played. Thus, Figure 9 illustrates a routine for displaying 
an Options panel In response to die Options button 7 bding pressed, if the Audio 
panel is enabled (step 901), then the Options panel is displayed with the Audio 
panel (step 902). If the Audio panel is not enabled (step 901), then the Options 
panel is displayed without the Audio panel (step 903). 

The Audio panel 154 includes a tide bar 142, a Stop button 143, a Play 
button 144, a Pause button 145, a Fast Forward button 146, a Rewind button 147, 
and an audio scope 153. As indicated by thdr names, buttons 143-147 are for 
controlling audio data and audio sequences (sound) graerated fiom audio data 
(e.g.., playing an audio sequence, rewinding a audio sequence, etc.). The button 
that is currentiy selected at any point in time is indicated by highlighting 118. 
Buttons 143-147 are selected and activated by the use of the remote control 1 1 
(i.e., buttons 15-19 of remote control 1 1) in the manner described above. The tide 
bar displays the tide name of the audio file currendy being played. 

The audio scope 153 is a display of moving audio waveforms representing 
the audio sequence being played are displayed. In the case of stereo data, two 
waveforms are displayed, as illustrated in Figure 8B, one waveform for the left 
audio channel and one waveform for the right audio channel. The wavefomM are 
generated by converting the digital audio data into a graphical plot Hie 
waveforms are scrolled horizontally across the audio scope 153 as the audio 
sequence is played, so diat die sound die user is hearing at any point in time is 
represented by the waveforms currentiy displayed in die audio scope 153. 

Rgure 10 illustrates a routine for enabling or disabling display of die 
Audio panel when the user activates a hypertext anchor on a Web page. Initially, 
the Web site specified by die anchor's URL is accessed by die client system 1 
(step 1001). The accessed file is then retrieved and a status indication is displayed 
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on the screen during downloading to indicate how much of the file has been 
downloaded so far (step 1 102). A downloaded HTML file will generally contain a 
MIME (Multi-purpose Internet Mail Extension) header, which specifies the content 
type of the file. Thus, after downloading the file, the client system 1 detemiines if 
die file contains a MIME header (step 1004). If the file contains a MIME header, 
the header is exannined to determine the content type of die file (stq) 1005). If the 
file contains audio content, dien die Audio panel is enabled (step 1006) and the 
audio file is played (step 1007). If die file does not contain audio content, then 
display of the audio panel is disabled (step 1009), and the retrieved file is 
displayed normaUy (step 1010). ' 

If (in step 1004) the file does not contain a MIME header, the client system 
1 determines the content type by examining the file extension of the downloaded 
file, if any. For example, die file extensions .AU, .WAV, and . AIFF indicate 
audio content After d^ennining the content type from the file extension, the 
routine proceeds as described above (from step 1005). 

The client system 1 has the ability to determine whether particular a audio 
function, such as Rewind, Pause, or Fast Forward, applies to a given downloaded 
file. For example, the Fast Forward function (button 146) docs not apply to a live 
audio stream, such as Real Audio. Therefore, for situations in which a particular 
audio function does not ^ly, the corresponding button is automatically disabled 
(Le., not displayed or di^layed in a manner that indicates the function is 
unavailable). 

Id one embodiment, the user can save into memcny an audio file that has 
been accessed fiom a Web site. For exanq>le, the user can save such an audio file 
as a background music file, so that the file is automatically played as pan of a 
repeating sequence of background music selections. A downloaded audio file-may 
be saved into memory using die Save button 151 displayed on die Options panel 

Hius, a method and syjparanis for providing a user interface to control 
audio functions in a Web teowser have been described Although die present 
invention has been described with referraice to specific exemplary embodiments, it 
will be evidoit diat various modifications and changes may be made to diese 
embodiments without departing from the broader spirit and scope of the invention 
as set forth in the claims. Accordingly, the specification and drawings are to be 
regarded in an iDusuative rather than a restrictive sense 
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CLAIMS 

What is claimed is: 

1 . In a client processing system capable of being coupled to a server 
processing system over a network and capable of being coupled to an audio output 
device, a method of processing audio infomiation, the method comprising the 
steps of: 

outputting a first audio sequence via the audio output device based on a 
first audio file stored in the client processing system; and 

downloading a second audio file into the client processing system while the 
first audio sequence is being output, wherein the second audio file is transmitted 
from the server processing system to the client processing system over the 
network, the second audio file representing a second audio sequence to be output 
following the outputting of the first audio sequence. 

2 . A method according to claim 1 . furtha* comprising the step of enabling a 
user to navigate through an interactive display environment including a Worid 
Wide Web page while ou^utting the first audio sequence. 

3 . A method according to claim 1 , wherein the audio ou^ut device is a 
conqxMient of a television set 

4. ri V A metiiod according to claim 3, the method further con^rising the stqi of 
enabling a user to browse the Worid Wide Web using the television set as a 
display device. 

5 . A method according to claim 1 , wherein the client system is further enable 
of being coupled to a display device, the method further comprising the steps of: 

receiving a user input entered fixjm a remote input device; 

determining based on the user input a hypertext object that is to be selected, 
wherein the object is one of a plurality of hypertext objects displayed on the 
display device; 

selecting said one of the objects based on the determining step; and 
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displaying on the display device a selection indicator identifying the 
selected hypertext object 

6 . A method according to claim 5, wherein the display device is a television 
set. 

7 . A client processing system capMc of being coupled to a server processing 
system over a netwcsk and enable of being coupled to a television set, the client 
processing system comprising: .. .. 

aprocessOT; 

a first memory unit coupled to the prxx^ssor, the first memory unit having 
stoned therein a first audio file representing a first audio sequence; and- 

a second memory unit coupled to the processor, the second memory unit 
having stored therein sequences of instructions which, when executed by the 
processor, cause: 

outputting of the first audio sequence via the television set; and 
downloading of a second audio file into thie fnst memory unit while 
the first audio sequence is played, wherein the second audio file transmitted fiom 
the server processing system over the network, wherein the second audio file 
represents a second audio sequence to be output via the television set following the 
outputting of the first audio sequence. 

8 . A method according to claim 7, wherein the second noemory unit further 
has instructions stored dierein which, when executed by the processor, enable a 
user of the client system to browse the World Wi6t Web using the television set 

9. A method according to daim 8, wherein tiie second menHHy unit further 
has instructions stored tiierein which, when executed by the processor, enable a 
user of the client system to browse die Wcnld Wide Web using a remote input 
device to input connxiands to die client processing system. 

10. A metiiod according to claim 7, wherein die second meouHy unit further 
has instructions stored therein which, when executed by the processor, cause: 

ii^tting of a user input entered from a ronote input device; 



• 
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determination based on the user input of a hypertext object that is to be 
selected, wherein the object is one of a plurality of hypertext objects displayed on 
the display device; 

selection of said one of the objects based on the determining step; and 
display on the display device of a selection indicator identifying the 
selected hypertext object 

11. In a client processing system coupled to a server processing system over a 
network and coupled to a display device, a method of providing a user interface 
for controlling audio functions of the client processing system, the method 
comprising the steps of: 

downloading audio information from the network; and 
causing an audio control panel to be displayed on the display device, the 
audio control panel including a plurality of images, wherein each of the images is 
for activating one of a plurality of audio functions for manipulating the 
downloaded audio information. 

12. A method according to claim 1 1 , further comprising the step of enabling 
the audio control panel to be activated in response to an audio Hie being 
downloaded from the network. 

13. A method accenting to claim 1 2, wherein the audio file is accessed over the 
network by a hypertext link. 

14. A method according to claim 12, further comprising the step of enabling 
the audio control panel to be activated only if an audio sequence is currently being 
output by the client processing system. 

15. A method according to claim 1 1, wherein the display device is a television 
display. 

16 A method according to claim 1 5, wherein the user input is entered from a 
remote input device. 
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17. A method according to claim 1 6» wherein the user input is entered from a 
remote input device by a user activating a control, the control for specifying a 
direction for moving a selection indicator displayed on the display device firom a 
previous position on the display device. 

18. A method according to claim 1 1, the method further comprising the steps 
of: 

receiving a second user input entered £tt>m a remote input device; 
detemnning based on the second user, input a one of the images that is to be 
selected; 

selecting said one of the objects based on the determining step; and 
displaying on the display device a selection indicator identifying the 
selected one of the images. 

19. A method according to claim 1 1 , wherein the client processing system is 
further coupled to an audio output device* the method further comprising the steps 
of: 

causing an audio sequence to be output from the audio output device based 
on the audio infommion; and 

causing a nu>ving waveform representing the amlio sequence to be 
displayed on the display device as the audio sequence is output 

20. A client processing system for communicating over a networic with at least 
one server processing system, the client processing system coupled to a television 
set, the client processing system conqnising: 

a processor, and 

a memory coupled to the processor, the memory having stored therein 
sequences of instructions which, when executed by the processor, cause die 
processor to perform the steps of: 

causing a first plurality of objects to be di^layed on the television set, 
wherein each of the first plurality of objects is a hypertext object corresponding to 
a World WTide Web page; 

recdving a first user input altered from a renx^ input device; 

selecting one of the first plurality of objects based on the first user input; 
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retrieving audio data corresponding to the selected one of the Hrst plurality 
of objects frcMn the server processing system; 

causing an audio control panel to be displayed on the television set, the 
audio control panel including a second plurality of objects, wherein each of the 
objects is for activating one of a plurality of audio functions, each of the audio 
functions for processing the audio data; and 

receiving a second user input entered finom the remote input device, the 
second user input for designating one of the second plurality of objects; and 

in response to the second user input, performing one of the audio 
functions. 

21. A method according to claim 20, whertin the step of performing one of the 
audio functions comprises the step of causing an audio sequence to be output from 
the television set based on the audio data. 

22. A method according to claim 21, wherein the st^ of causing an audio 
control panel to be displayed on the television set comprises the step of causing a 
moving waveform representing the audio sequence to be displayed on the 
television set as the audio sequence is output from the television set 

23. A method according to claim 20, wherein the step of causing an audio 
control panel to be displayed on the television set comprises the step of 
dctcniiining whether one of the audio functions can be perfoimed with respect to 
the audio data. 

24. A method according to claim 20, wherein the step of causing an audio 
control panel to be displayed on the television set comprises the step of disabling a 
display of one of the second plurality of objects if a ccnesponding one of the 
audio functions cannot be performed with respect to the audio data. 

25 . A method according to claim 20, wherein the step of causing an audio 
control panel to be displayed on the television set further conq)rises die step of 
determining whether the audio data corresponds to live audio. 
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26. A method according to claim 25, wherein the step of causing an audio 
control panel to be displayed on the television set further comprises the step of 
disabling a display of one of the second plurality of objects if the audio data is 
determined to conespond to live audio. 

27 . In a client processing system coupled to a server processing system over a 
netwoirk and coupled to a display device, a method of providing a user interface 
for conux>lling audio functions of the client processing system, the method 
con^msing the steps of: 

receiving a user input selecting a hypertext anchor displayed on the display 

device; 

in response to the user input, requesting data represented by the hypertext 
anchor from the server processing system; 

recdving data represented by the hypertext anchor from the server . 
processing system; 

determining whedier the data includes audio data; and 

enabling an audio control panel to be displayed on the display device based 
on the determining step, the audio control panel fonning a user interface for 
allowing a user to control a plurality of audio functions associated widi the audio 

Hfltfl 

28. A method according to claim 27, further conoprising the stq> of disabling 
display of the audio control panel if the data is not determined to include audio 
data. 

29. A method according to claim 27, wherein die step of enabling the audio 
control panel to be displayed on die display device based on die determining st^ 
ccHnprises die step of enabling the audio control panel to be displayed on the 
display device only if the data includes audio data. 

30. A method according to claim 27, wherein the step of enabling the audio 
control panel to be displayed on the display device based on the determining step 
comprises the step of enabling the audio control panel to be displayed on the 
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display device only if an audio sequence corresponding to the audio data is 
currently being output by the client processing system. 

31. A method according to claim 27» wherein the client processing system is 
further coupled to an audio output device, the method further comprising the steps 
of: 

causing an audio sequence representing the audio data to be output via the 
audio output device; and 

disabling display of the audio control panel upon completion of outputtuig 
the audio sequence. 

32. A method accotxiing to claim 27, wherein the audio control panel includes a 
plurality of displayable objects, each of the objects representing one of the audio 
functions. 

33 . A method according to claim 27, further comprising the step of 
determining whetiier each of the audio functions can be performed with respect to 
the data. 

34. A method according to claim 33. fiorther comprising the step of disabling a 
display of one of the objects if a corresponding one of the audio functions cannot 
be perfoimed with respect to the data. 

35. A method according to claim 27, further con^msing tiie st^ of 
determining whether the data represents live audio. 

36. A method according to claim 35, further con^3dsing the step of disabling a 
display of one of the objects if the data is determined to represent live audio. 
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